CDS

Accession Number TCMCG074C12032
gbkey CDS
Protein Id KAF8393639.1
Location complement(join(38642155..38642247,38643406..38643465,38644576..38644671,38645914..38645996,38646179..38646237,38650326..38650422,38651738..38651819,38651907..38652001,38655443..38655518))
Organism Tetracentron sinense
locus_tag HHK36_021885

Protein

Length 246aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA625382, BioSample:SAMN14615867
db_source JABCRI010000015.1
Definition hypothetical protein HHK36_021885 [Tetracentron sinense]
Locus_tag HHK36_021885

EGGNOG-MAPPER Annotation

COG_category O
Description The proteasome is a multicatalytic proteinase complex which is characterized by its ability to cleave peptides with Arg, Phe, Tyr, Leu, and Glu adjacent to the leaving group at neutral or slightly basic pH
KEGG_TC -
KEGG_Module M00337        [VIEW IN KEGG]
M00340        [VIEW IN KEGG]
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
ko03051        [VIEW IN KEGG]
KEGG_ko ko:K02730        [VIEW IN KEGG]
EC 3.4.25.1        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko03050        [VIEW IN KEGG]
map03050        [VIEW IN KEGG]
GOs GO:0000502        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005839        [VIEW IN EMBL-EBI]
GO:0019773        [VIEW IN EMBL-EBI]
GO:0032991        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:1902494        [VIEW IN EMBL-EBI]
GO:1905368        [VIEW IN EMBL-EBI]
GO:1905369        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGAGTCGTGGAGCTGGAGCTGGCTACGATCGTCACATTACCATTTTTTCTCCCGAAGGTCGTCTGTTCCAAGTCGAGTATGCTTTTAAGGCTGTGAAGGCTTCTGGGATCACCTCGATTGGTGTCCGAGGAAAGGACACGGTCTGTGTTGTCACACAAAAGAAGGTTCCGGACAAGCTTTTGGATCAGACTAGTGTTACACATCTGTTTCCCATTACAAAGTACCTCGGATTGTTAGCCACGGGAATGACAGCTGATGCGAGGACCTTGGTCCAACAAGCAAGGAGTGAAGCAGCTGAGTTTCGTTTCAGATATGGATATGAGATGCCTGTGGATGTATTGGCCCGATGGATTGCAGACAAATCACAAGTCTATACTCAACACGCTTATATGAGGCCTCTTGGAGTAGCTGCTATCATTTTGGGTATTGATGAAGAAAATGGGCCTCAGCTCTTCAAGTGTGACCCAGCTGGCCATTTTTTTGGACACAAGGCTACAAGTGCTGGCTTAAAAGAACAAGAGGCAATTAATTTCTTGGAGAAGAAAATGAAGAATGACCCTGCATTCTCCTATGAGGAAACTGTACAGACTGCAATTTCTGCTCTGCAATCAGTTCTACAAGAGGACTTCAAGGTCAATGAGATTGAGGTAGGAGTTGTAGGACAAGAGAGCCGTGTCTTCAGAATCCTGTCGACTGAGGAGATTGATGAGCATTTGACGGCCATAAGCGAGCGTGATTAA
Protein:  
MSRGAGAGYDRHITIFSPEGRLFQVEYAFKAVKASGITSIGVRGKDTVCVVTQKKVPDKLLDQTSVTHLFPITKYLGLLATGMTADARTLVQQARSEAAEFRFRYGYEMPVDVLARWIADKSQVYTQHAYMRPLGVAAIILGIDEENGPQLFKCDPAGHFFGHKATSAGLKEQEAINFLEKKMKNDPAFSYEETVQTAISALQSVLQEDFKVNEIEVGVVGQESRVFRILSTEEIDEHLTAISERD